Auto Tuning of Hadoop and Spark parameters
نویسندگان
چکیده
Data of the order terabytes, petabytes, or beyond is known as Big Data. This data cannot be processed using traditional database software, and hence there comes need for Platforms. By combining capabilities features various big applications utilities, Platforms form a single solution. It platform that helps to develop, deploy manage environment. Hadoop Spark are two open-source provided by Apache. Both these platforms have many configurational parameters, which can unforeseen effects on execution time, accuracy, etc. Manual tuning parameters tiresome, automatic ways should needed tune them. After studying analyzing previous works in automating this paper proposes algorithms - Grid Search with Finer Tuning Controlled Random Search. The performance indicator studied Execution Time. These help automatically. Experimental results shown reduction time about 70% 50% 81.19% 77.77% Search, respectively.
منابع مشابه
Pre-stack Kirchhoff Time Migration on Hadoop and Spark
Pre-stack Kirchhoff time migration (PKTM) is one of the most widely used migration algorithms in seismic imaging area. However, PKTM takes considerable time due to its high computational cost, which greatly affects the working efficiency of oil industry. Due to its high fault tolerance and scalability, Hadoop has become the most popular platform for big data processing. To overcome the shortcom...
متن کاملSpark Parameter Tuning via Trial-and-Error
Spark has been established as an attractive platform for big data analysis, since it manages to hide most of the complexities related to parallelism, fault tolerance and cluster setting from developers. However, this comes at the expense of having over 150 configurable parameters, the impact of which cannot be exhaustively examined due to the exponential amount of their combinations. The defaul...
متن کاملTowards Energy Auto-Tuning
Energy efficiency is gaining more and more importance, since well-known ecological reasons lead to rising energy costs. In consequence, energy consumption is now also an important economical criterion. Energy consumption of single hardware resources has been thoroughly optimized for years. Now software becomes the major target of energy optimization. In this paper we introduce an approach calle...
متن کاملPERI Auto-Tuning
The enormous and growing complexity of today's high-end systems has increased the already significant challenges of obtaining high performance on today's equally complex scientific applications. Application scientists are faced with a daunting challenge in tuning their codes to exploit performance-enhancing architectural features. The Performance Engineering Research Institute (PERI) is working...
متن کاملAuto-Tuning Parallel Skeletons
Parallel skeletons are a structured parallel programming abstraction that provide programmers with a predefined set of algorithmic templates that can be combined, nested and parameterized with sequential code to produce complex programs. The implementation of these skeletons is currently a manual process, requiring human expertise to choose suitable implementation parameters that provide good p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International journal of engineering trends and technology
سال: 2021
ISSN: ['2231-5381', '2349-0918']
DOI: https://doi.org/10.14445/22315381/ijett-v69i11p204